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Abstract 

In this paper, we introduce Gabor shearlets, a variant of shearlet systems, which are based on a different group 
representation than previous shearlet constructions: they combine elements from Gabor and wavelet frames 
in their construction. As a consequence, they can be implemented with standard filters from wavelet theory 
in combination with standard Gabor windows. Unlike the usual shearlets, the new construction can achieve 
a redundancy as close to one as desired. Our construction follows the general strategy for shearlets. First we 
define group-based Gabor shearlets and then modify them to a cone-adapted version. In combination with 
Meyer filters, the cone-adapted Gabor shearlets constitute a tight frame and provide low-redundancy sparse 
approximations of the common model class of anisotropic features which are cartoon-like functions. 
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1. Introduction 

During the last 10 years, directional representation systems such as curvelets and shearlets were intro- 
duced to accommodate the need for sparse approximations of anisotropic features in multivariate data. These 
anisotropic features, such as singularities on lower dimensional embedded manifolds, called for representa- 
tion systems to sparsely approximate such data. Prominent examples in the 2-dimensional setting are edge- 
like structures in images in the regime of explicitly given data and shock fronts in transport equations in the 
regime of implicitly given data. Because of their isotropic nature, wavelets are not as well adapted to this task 



as curvelets yfl, contourlets |6|], or shearlets UM- Recently, a general framework for directional representa- 
tion systems based on parabolic scaling - a scaling adapted to the fact that the regularity of the singularity in 
the considered model is C 2 - was introduced in |8{| seeking to provide a comprehensive viewpoint towards 
sparse approximations of cartoon-like functions. 

Each system comes with its own advantages and disadvantages. Shearlet systems distinguished them- 
selves by the fact that these systems are available as compactly supported systems - which is desirable for 
applications requiring high spatial localization such as PDE solvers - and also provide a unified treatment of 
the continuum and digital setting thereby ensuring faithful implementations. Shearlets were introduced in Jgl 
with the early theory focussing on band-limited shearlets, see e.g. Hill . Later, a compactly supported variant 



was introduced in 11711 . which again provides optimally sparse approximations of cartoon-like functions [19]. 



In contrast to those properties, contourlets do not provide optimally sparse approximations and curvelets are 
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neither compactly supported nor do they treat the continuum and digital realm uniformly due to the fact that 
they are based on rotation in contrast to shearing. 

1.1. Key Problem 

One major problem - which might even be considered a "holy grail" of the area of geometric multiscale 
analysis - is whether a system can be designed which is 

(PI) an orthonormal basis, 
(P2) compactly supported, 
(P3) possesses a multiresolution structure, 

(P4) and provides optimally sparse approximations of cartoon-like functions. 

Focussing from now on entirely on shearlets, we can observe that bandlimited shearlets satisfy (P4) while 
replacing (PI) with being a tight frame. Compactly supported shearlets accommodate (P2) and (P4), and 
form a frame with controllable frame bounds as a substitute for (PI). We are still far from being able to 
construct a system satisfying all those properties - also by going beyond shearlets - , and it is not even clear 
whether this is at all possible, cf. also lllql . Several further attempts were already made in the past. In 1I20I1 . 
shearlet systems were introduced based on a subdivision scheme, which naturally leads to (P2) and (P3), but 
not (PI) - not even being tight - and (P4). In lfl2Tl . a different multiresolution approach was utilized leading 
to systems which satisfy (P2) and (P3), but not (P4), and (PI) only by forming a tight frame without results 
on their redundancy. 

1.2. What are Gabor Shearlets ? 

The main idea of the present construction is to use a deformation of the group operation that common 
shearlet systems are based upon together with a decomposition in the frequency domain to ensure an almost 
uniform treatment of different directions, while modeling the systems as closely as possible after the one- 
dimensional multiresolution analysis (MRA) wavelets. To be more precise, the new group operation includes 
shears and chirp modulations which satisfy the well-studied Weyl-Heisenberg commutation relations. Thus, 
the shear part naturally leads us to Gabor frame constructions instead of an alternative viewpoint in which 
shears enter in composite dilations [1 1 Ofl - The filters appearing in this construction can be chosen as the 
trigonometric polynomials belonging to standard wavelets or to M-band versions of them, or as the smooth 
filters associated with Meyer's construction. To achieve the optimal approximation rate for cartoon-like 
functions, we use a cone adaptation procedure. But in contrast to other constructions, we avoid incorporating 
redundancy in this step. 

It is interesting to notice that due to the different group structure, Gabor shearlets do not fall into the frame- 
work of parabolic molecules (cf. |8|]) although they are based on parabolic scaling. Thus, this framework can 
not be used in our situation for deriving results on sparse approximations by transfering such properties from 
other systems. 

1.3. Our Contributions 

Gabor shearlets satisfy the following properties, related to Subsection ll.il 

(PI*) Gabor shearlets can be chosen to be unit norm and '-tight, where b~ l - which can be interpreted as 
the redundancy (cf. Subsection l2.41 i - can be chosen arbitrarily close to one. 

(P2*) Gabor shearlets are not compactly supported, but can be constructed with polynomial decay in the spa- 
tial domain. 

(P3) The two-scale relation for the shearlet subband decomposition is implemented with standard filters re- 
lated to MRA wavelets. 

(P4) In conjunction with a cone-adaptation strategy and Meyer filters, Gabor shearlets provide optimally 
sparse approximations of cartoon-like functions. 

Thus, (P3) and (P4) are satisfied. (PI) is approximately satisfied in the sense that the systems with property 
(PI*) are arbitrarily close to being orthonormal bases. And (P2) is also approximately satisfied by replacing 
compact support by polynomial decay in (P2*). It is in this sense that we believe the development of Gabor 
shearlets contributes to introducing a system satisfying (P1)-(P4). Or - if it could be proven that those are 
not simultaneously satisfiable - providing a close approximation to those. 
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1.4. Outline of the Paper 

The remainder of this paper is organized as follows. In Section [2] we set the notation and recall the 
essential properties of Gabor systems, wavelets, and shearlets which are needed in the sequel. In this section, 
we also briefly introduce the notion of redundancy first advocated in (J], In Section[3] after providing some 
intuition on our approach we introduce Gabor shearlets based on a group related to chirp modulations and 
discuss their frame properties and the associated multiresolution structure. The projection of those Gabor 
shearlets on cones in the frequency domain is then the focus of Section|4] again starting with the construction 
followed by a discussion of similar properties as before. The last section, Section[5] contains the analysis of 
sparse approximation properties of cone-adapted Gabor shearlets. 



2. Revisited: Wavelets, Shearlets, and Gabor Systems 

In this section, we introduce the main notation of this paper, state the basic definitions of Gabor systems, 
wavelets, and shearlets, and also recall the underlying construction principles, formulated in such a way 
that Gabor shearlets will become a relatively straightforward generalization. We emphasize that this is not 
an introduction to Gabor and wavelet theory, and we expect the reader to have some background knowledge, 
otherwise we refer to [0] or [ 21 1 . A good general reference for most of the material presented in this section is 



the book by Weiss and Hernandez 112 3H . In the last part of this section, we discuss the viewpoint of redundancy 
from 111], which we adopt in this paper. 

In what follows, the Fourier transform of / e L l (W) is defined to be /(£) := JT f(x)e^ 2mx ^dx, where 
x ■ g is the dot product between x and £ in R". As usual, we extend this integral transform to the unitary 
map / i-> / defined for any function / which is square integrable. The unitarity is captured in the Plancherel 
identity (f,g) = </,g) for any two functions f,g e L 2 (R") with (f,g) := j Rn f(x)g(x)dx. 

2.1. MRA Wavelets 

Let {<f>, iff] be a pair of a scaling function and a wavelet for L 2 (R) associated with a pair of a low-pass filter 
H : T — > C and a high-pass filter G : T — » C, for convenience defined on the torus T = {z e C : \z\ = 1}. We 
start by recalling the Smith-Barnwell condition for filters. 

Definition 2.1. A filter H : T — > C satisfies the Smith-Barnwell condition, if 



\H(z)\ 2 + \H(-z)\ 2 = 1 



for almost every z € T. 



The Smith-Barnwell condition is an essential ingredient in the characterization of localized multireso- 
lution analyses; that is, the scaling functions (f> are localized in the sense of having faster than polynomial 
decay: ^(1 + x 2 )"\(f>(x)\ 2 dx < oo for all n e N. 



Theorem 2.1 (Cohen, as in UM Theorem 4.23 of Chapter 7). A C°° function H : T -> C is the low-pass 



filter of a localized multiresolution analysis with scaling function <f> given by 

oo 

7=1 

if and only if H(\) = 1, H satisfies the Smith-Barnwell condition, and there exists a set K C T which contains 
1 and has a finite complement in T such that H(z 2 J ) + Ofor all j E Z, j > , and z 6 K. 

The two-scale relations for <p and iff are conveniently expressed in the frequency domain, 
$(2Z)=H{e- 7 * i t)m and ^(2£) = Gfe- 2 **)^, a.e. f e R. 
The orthonormality of the integer translates of {<p, iff) is captured in the matrix identity 



M(z)M(z)* = h with M(z):= 



H(z) H(-z) 
G(z) G(-z) 



for a.e. z € T . 
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Often, only H is specified and the matrix has to be completed to a unitary, with a common choice being 
G{z) = -zHFz). 

The low-pass filter of the Meyer scaling function is of particular use for the construction of Gabor shear- 
lets, which will be shown in Section|5]to yield optimal sparse approximations. The Meyer scaling function <p 
and wavelet function if/ are given by 



i 



if|£l<|, 
cos(f v(3|f| - 1)) if| 
otherwise 



- 3 <\f\< I 



and 



- e -^sin[fv(3|£|-l)] if \<\S\<\ 

- e -^ cos || v( ||^|_l)j if | < £| < 4 

otherwise. 



Here, v is a function satisfying y(x) = for x < 0, y(x) = 1 for x > 1, and in addition, v(x) + v(l - x) = 1 for 
< x < 1 . For example, v can be defined to be v(x) = x 4 (35 - 84x + 70x 2 - 20x 3 ) for x 6 [0, 1], which leads 
to C 3 functions of and i^. 

Then, the corresponding H is given by 



1 



lfl< 



cos(fv(6|£| - 1)) ±<|f|< 




5 < If I < 5- 



We remark that £ h> H(e- 2 " ii: ) is a 1 -periodic function and the Meyer wavelet function \ji defined above 

satisfies tA(2£) = -e- 2ni ? H(e- 2ni( ^^jcp(^). Hence the high-pass filter G for <A is given by G(z) = -zH(-z) 
with z = e~ 2m ^ . For any teM, there exists v such that <p and i/f are functions in C k (K). Moreover, v can be 
constructed to be C°° so that both cf> and if/ are functions in C°°(]R) and their corresponding filters are functions 
in C°°(T). For more details about Meyer wavelets, we refer to Q4J| or 1121 1. 



2.1.1. Subband Decomposition for Discrete Data 

The two-scale relation in combination with downsampling as a simple data reduction strategy is crucial 
for the efficient decomposition of data from some approximation space, say Vq. We next formalize the 
decomposition of a function / e Vq = V-\ ffi W-\ in terms of the Z-transform. 

For this, let the group of integer translations {T„}„ e z acting on L (K) be defined by T„f(x) = f(x - n) for 
almost every x e K. Then, each function / 6 Vo can be expressed as 

This enables us to associate with / the values of the almost everywhere converging series 

oo 

Zf(z) = J] c " z "> z e T ' 

«=-oo 

Letting now H : T — » C be the low-pass filter of a localized multiresolution analysis as specified above, 
the characterization of the subspace V-i c Vq can then be expressed as 

/eV_, <=» Zf(z)=H(z)(H(dZf(z)+H(=z)Zf(~zj) fora.e. zeT. 
This fact enables us to state a unified characterization of V-\ and of W_i = Vo 9 V_i . 

Proposition 2.1. Let Py t and Pw_ t denote the orthogonal projection ofVo onto V_i and W-i, respectively. 
Further, letting H be defined as above, define H + to be the multiplication operator given by H+F(z) — 
H(z)F(z), H given by H-F(z) — H(—z)F(z), and R2 the reflection operator satisfying RiZfiz) — Zf{—z). 
Then, we have 

ZP V _J = H + {I + R 2 )ir + Zf and ZP W _J = 7T(I-R 2 )H-Zf. 
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Proof. We first observe that the composition of down and up-sampling sets every other coefficient in the 
expansion of / to zero. After applying Z this amounts to a periodization. 
By definition, the projection onto V-\ satisfies 

ZP v _J(z) = H(z) (WzjZfiz) + HFz)Zf(-z)) = H + (I + RiWlZfd) . 
Similarly, the projection onto W-i is 

ZP w _J(z) = G(z) (G(z)Zf(z) + G(=zjZf(-z)) 

= -zHFz)(-zH(-z)Zf(z) + zH(z)Zf(-z)) 



= H(-z)(H(-z)Zf(z) - H(z)Zf(-z)) 
= Hl(I-R2)H-Zf(z). 

The proposition is proved. □ 

The relevance of these identities lies in the fact that (7 + R2)H + Zf is an even function whereas (7 - 
Ri)H-Zf is odd. Hence knowing every other coefficient in the series expansion is sufficient to determine the 
projection onto the corresponding subband. Thus, in this case downsampling reduces the data without loss of 
information. 

2.7.2. M -Band Wavelets 

If instead of a dilation factor of 2 in the two-scale relation, a factor of M is used, M-1 wavelets are 
necessary to complement the translates of <p to an orthonormal basis of the next higher resolution level. 
In this situation, it is an M x M matrix which has to satisfy the orthogonality identity. Generalizing the 
consideration in the previous subsection, let a> = e~ 2m/M and RmZ/(z) = Zf(ojz) and the scaling mask Hq for 
<p satisfy Z/io 1 \Ho(^z)\ 2 = 1. We then define the orthogonal projection onto V-\ in terms of the transform 



(M-\ 



ZP V _J = H 



2X 



H Zf. 



For a proof that fV , is indeed a projection, see the more general statement in the next theorem. 

We complement the filter Hq by finding H„ such that (H„(a> 1 z)) M l} is unitary for almost every z e T. 
Once the wavelet masks Hi, I = 1, ... ,M - 1 are constructed by matrix extension, the wavelet functions 
tff { , I = 1, . . . , M - 1 are given by ~tyc{M£) = H e {e' 2 ^ ), £ e R, I = 1, . . . , M - 1. It is well-known that 
then {if/{ : £ = \, . . . ,M — 1} generates an orthonormal wavelet basis for L 2 (W). 

One goal in M-band wavelet design is to choose Hq and then to complete the matrix so that the filters 
H„ impart desirable properties on the associated scaling function and wavelets. In fact, one can construct 
orthonormal scaling functions for any dilation factor M > 2 and the matrix extension technique applies 
for any dilation factor M > 2. When M > 2, the orthonormal bases can be built to be with symmetry 



(1111141 U5D). 

In the same terminology as Proposition ^. II we now have the following result that identifies the orthogonal 
projections belonging to M-band wavelets. 

Theorem 2.2. Let {H,,}^ 1 be such that {H n {<jj 1 z))„ Z. * s un ^ al J f or almost every z e T, and let {Pw-uh^q 
be the operators defined by 

I'M -l \ 



2X 



H c Zf. 



Then {Pw_, f }/ln are mutua Hy orthogonal projections (note that Pw_,,Q = Pv^)- 

Proof. We first observe that by the assumed unitarity, every row is normalized, and each pair of rows is 
mutually orthogonal, i.e., 

M-1 



V H,(Lo e z)H m (a> c z) = 5„,m , 



1=0 
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where d n<m — Win-m and 6 n<m — otherwise. 

Next, we show that each fV , f is an orthogonal projection. To begin with, we see that t is Hermitian 
because the sum 2>o ^m ^ s anc ' tn i s P ro P er ty is retained when it is conjugated by the multiplication operator 
H( . The fact that each iV , e is idempotent and that the projections are mutually orthogonal is due to the 
commutation relation 

RmH{ = H[(u -)R M 

and because of the orthogonality of the rows in (H,„( fa/z))^. We have for / € L 2 (R) and almost every 

Z e T, 

M-l M-l 

ZPw^Pw^f(z) = H e Y J R' M H~eH k J] R" M H~ k Zf(z) 

M-l M-l 

= H ( Y J H e (cj'"z)H k (cj"'z)RZ J] R" M H~ k Zf(z) 

m=0 «=0 
M-l M-l 

= H e H e (af'z)H k (cj'"z) ^ R n M ~H k Zf{z) 

m=0 n=0 
M-l 

= 6 u H e J] R" M H~ k Zf(z) = 6 a ZP w _ u f(z) . 

ji=0 

This finishes the proof. □ 

2.2. From Group-Based to Cone-Adapted Shearlets 

In contrast to wavelets, shearlet systems are based on three operations: scaling, translation, and shearing; 
the last one to change the orientation of those anisotropic functions. Letting the (parabolic) scaling matrix Aj 
be defined by 



and the shearing matrix S k be 



1 I 1 



Then, for some generator tf> e L 2 (R 2 ), the group-based shearlet system is defined by 

{Tiil/(S k Aj ■ -m) : j, k e 1, m e Z 2 }. 

Despite the nice mathematical properties - this system can be regarded as arising from a representation 
of a locally compact group, the shearlet group - group-based shearlet systems suffer from the fact that they 
are biased towards one axis which prevents a uniform treatment of directions. Cone-adapted shearlet sys- 
tems circumvent this problem, by utilizing a particular splitting of the frequency domain into a vertical and 
horizontal part. For this, we set A } \ :- Aj, S h k := S k , 

Given a scaling function (f> e L 2 (M 2 ) and some if/ e L 2 (R 2 ), the cone-adapted shearlet system is defined by 

{(/>(■- m) : m e Z 2 } U {2 3jV V(S^ ■ ~m) : j>0, \k\ < 2 j , m e Z 2 } 
U tfi'^iSlA) ■ -m) :j>0, \k\ < 2 j , m e Z 2 ), 



where ifr(x\ , xi) = >f( x i > x i)- For more details on shearlets, we refer to 118 1 . 

Gabor shearlets will also be constructed first as group based systems, and then in a cone-adapted version. 
However, in contrast to other constructions, we aim at low redundancy in the group-based system and avoid 
increasing it in the cone adaptation. 
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2.3. Gabor Frames 

Like the previous systems, Gabor systems are based on translation and modulation. As usual, we denote 
the modulations on L 2 (R) by M m f(tf) = e 2ni "^f(^). 

By definition of tightness, a square-integrable function w : R — > C is the window of a b~ l -tight Gabor 
frame, if it is unit norm and for all / e L 2 (R), 

\\ff=b J] \<f' M mbTnW)\ 2 . 

For more details on Gabor systems, we refer the reader to [Q. 

Various ways to construct such a window function w are known. We recall a construction of a b~ l -tight 
Gabor frame with b~ l > 1 arbitrarily close to 1 []5j] . 

Example 2.1. Let v be in C°°(R) and v(x) = for x < 0, v(x) = 1 for x > 1 and v(l - x) + v(x) = 1. 
Let w(x) := (v((l/2 + e — \x\)/2e)) 1 ^ 2 , x € R. 77ien, if is easy fo smow f«af w is a smooth function with 
support belonging to [—1/2 — s, 1/2 + e]/or any < e < 1/2, ||w|| 2 = 1 ana" |r„w| 2 = 1. Consequently, if 
b — (1 + 2e)~', fnen {M m £r„w : m, n e Z} defines a b~ ] -tight Gabor frame. 

2.4. Redundancy 

Since we cannot achieve (PI), but would like to approximate this property, besides the classical frame 
definition, we also require a notion of redundancy. The first more refined definition of redundancy besides 
the classical "number of elements divided by the dimension" definition was introduced in Jl|]. The extension 
of this definition to the infinitely dimensional case can be found in fl. Since this work is not intended for 
publication, we make this subsection self-contained. 

We start by recalling a redundancy function, which provides a means to measure the concentration of the 
frame close to one vector. If {v?,}, e / is a frame for a real or complex Hilbert space 'H without any zero vectors, 
and let § = {x e <H : ||x|| = 1}, then for each x e S, the associated redundancy function % : § — > R + U {oo} is 
defined by 

iel 

Taking the supremum or the infimum over x in this definition gives rise to the so-called upper and lower 
redundancy, which is in fact the upper and lower frame bound of the associated normalized frame, 

n + = sup^O) and <RT = inf ^(x). 

xeS A ' eS 

For those values, it was proven in [Q]] that in the finite-dimensional situation, the upper redundancy provides 
a means to measure the minimal number of linearly independent sets, and the lower redundancy is related to 
the maximal number of spanning sets, thereby linking analytic to algebraic properties. 

It is immediate to see that an orthonormal basis satisfies 'Kr — ft^ = 1, and a unit norm A-tight frame 
9?" = - A. This motivates the following definition. 

Definition 2.2. A frame {y,-}/ =e / for a real or complex Hilbert space has a uniform redundancy, ifRr = ?i + , 
and if it is unit norm and A-tight, then we say that it has redundancy A. 

In the sequel, we will use the redundancy to determine to which extent (PI) is satisfied. 

3. Group-Based Gabor Shearlets 

Let us start with an informal description of the construction of Gabor shearlets in a special case with the 
goal to first provide some intuition for the reader. 

Generally speaking, the shearlet construction in this paper is a Meyer-type modification of a multires- 
olution analysis based on the Shannon shearlet scaling function Oo,o,o = Xk, where K — e R 2 : |£i| < 
1 and |^2 /£i I ^ 1/2}- For an illustration, we refer to FigureJT] 
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Figure 1: Support of the scaling function belonging to group-based Shannon shearlets (as well as the group-based Gabor shearlets) in 
the frequency domain. Additional lines indicate the boundaries of the support for sheared scaling functions. 

It is straightforward to verify that chirp modulations 

$o,o, m (£) = XK {g)e^^e nim ^\ m = (m u m 2 ) e Z 2 , 
define an orthonormal system {Oo,o,m : w £ ^ 2 }, while the use of the usual modulations 

gives a 2-tight frame {Yoo jm : m 6 Z 2 } for its span. The same is true when the modulations are augmented 
with shears, fl>o,*,m(£) = ^o,o,m(^i > £2 _ ) an d likewise for To,^ m , in order to form the orthonormal or tight 
systems {<S>o,k,m ■ k e Z,m e l}\ or {To,*,,,, : k e Z, m G Z 2 }, respectively. Because both systems are 
unit-norm, the tightness constant is a good measure for redundancy as detailed in Subsection l2.4l indicating 
that chirp modulations are preferable from this point of view. Incorporating parabolic scaling preserves those 
properties. 

In a second step (Section|4]i the strategy of shearlets is followed to derive a cone-adapted version, which 
provides the property of a uniform treatment of directions necessary for optimal sparse approximation results. 
A further necessary ingredient for optimal sparsity are good decay properties. We show that a combination 
of Gabor frames, Meyer wavelets and a change of coordinates provides smooth alternatives for the charac- 
teristic function, yet with still near-orthonormal shearlet systems that are similar to the Shannon shearlet we 
described. 

3.1. Construction using Chirp Modulations 

To begin the shearlet construction, we examine an alternative group of translations acting as chirp mod- 
ulations in the frequency domain. These modulations do not correspond to the usual Euclidean translations, 
but for implementations in the frequency domain this is not essential. In the following, we use the notation 
K* :=R\{0}. 
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Definition 3.1. Let y^) := (yi(0, 71(f)) with ri (f) := = IsgntfOf 2 W y 2 (f) := | /or f = 

(fi.fi) el'xl Wfe define the two-dimensional chirp-modulations [Xp : /? e M. 2 } by 

We emphasize that the set with f 1 = is excluded from the domain, which does not cause problems since 
it has measure zero. 

Next, notice that the point transformation y has a Jacobian of magnitude one and is a bijection on W X R. 
Therefore, it defines a unitary operator T according to 

r/(f) = /(y(f)), feR'xM. 

As discussed in Subsection |2.21 the shear operator is a further ingredient of shearlet systems. By abuse of 
notation, for any s e R, we will also regard S s as an operator, that is 

s s m,&) = 7^1,^2-^1). 

The benefit of choosing the chirp-modulations is that shearing and modulation satisfy the well-known 
Weyl-Heisenberg commutation relations. The proof of the following result is a straightforward calculation, 
hence we omit it. 

Proposition 3.1. For s e E and /} e R 2 , 

S S X P = e- 2nihs X p S s . 

The last ingredient is a scaling operator which gives parabolic scaling. Again abusing notation, we write 
the dilation operator with Aj. For j e Z, we let Aj be the dilation operator acting on / e L 2 (R 2 ) by 

Ajf(g lt &) = 2- 3 -'' /2 /(2- 2 ^ 1 ,2-^ 2 ) 

for almost very f = (fi.fi) G R 2 - 

The last ingredient to define group-based Gabor shearlets are the generating functions to which those 
three operators are then applied. For this, let <p be an orthogonal scaling function of a 16-band multiresolution 
analysis in L 2 (R), with associated orthonormal wavelets \\jj(}\ 5 =v and let w be the unit norm window function 
of a 1 -tight Gabor frame {M^bTkW : mi, k e Z) for L 2 (R). Then we define the generators 

O^:=F0®w and ^00 : ~ ^ c ® w > ^ = 1, - . . , 15. 

in L 2 (R 2 ) = L 2 (R) ® L 2 (R), based on which we now define group-based Gabor shearlets. 

Definition 3.2. Let <t>o,o,o and ^qqq, ( — 1, . . . , 15, and w be defined as above. Let jo e Z. Then the 
group-based Gabor shearlet system is defined by 

ggS jo (<t>, m)l,;w) := {<i> joAm : £ e Z, m e Z 2 } U [¥ jkm :j,ke Z, j > j , m e Z 2 , t = 1, . . . , 15} c L 2 (R 2 ), 
where 

®j,kjn& = A jX( miyln2 b)S ^0,0,0 

= 2- 3 ^ 2 0(2- 4 Vi(^))w(2^2(^) - k)e 2nimTiiy ^ ) e lnimih2 ' yi ^\ 

and 

V ''j.i.„,*^' = AjXfa jn 2 b)S ^0,0,0 

= 2- 3j/2 ^r(2- 4j yi(t)M2 j y2(t) - k)e 2nimTiiy ^ ) e 2nimib2i ^ ) . 

The particular choice of dilation factors in the first and second coordinate comes from the need for 
parabolic scaling and integer dilations. The motivation is that the regularity of the singularity in the cartoon- 
like model is C 2 , and if the generator satisfies width = length 2 one can basically linearize the curve inside 
the support with controllable error by the Taylor expansion. Since we utilize a different group operation, it 
is not immediately clear which scaling leads to the size constraints width = length 2 . An integer value of j 
requires Aj — j 2 , so j = 4. Then one considers the intertwining relationship between the dilation operator A4 
and the standard one-dimensional dyadic dilation D to deduce A4F = FIT 16 ®£) 4 , which explains the choice 
of M = 16 bands. 
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3.2. MRA Structure 

One crucial question is whether the just introduced system is associated with an MRA structure. As a first 
step, we define associated scaling and wavelet spaces. 

Definition 3.3. Let <&j,k,m and m , j, k e Z, m e Z 2 , 1 — 1, . . . , 15 be defined as in Definition \3.2\ For each 
j € Z, the scaling space Vj is the closed subspace 

Vj = span{(D jAm :keZ,me Z 2 } c L 2 (K 2 ), 

and the associated wavelet space Wj is defined by 

Wj = spanf^,,, :teZ,meZ 2 ,M,..., 15). 

Next, we establish that the group-based Gabor shearlet system is indeed associated with an MRA struc- 
ture, and analyze how close it is to being an orthonormal basis. 

Theorem 3.1. Let <&j,k,m and T.^ m , j,k e Z,m e Z 2 ,^ = 1 15 be defined as in Definition ^. 2\ and let 

{VjljeZ an d {Wj}j £ z be the associated scaling and wavelet spaces as defined in Definition \3.3\ Then, for each 
j E Z, the family {<$> j t k,m '■ k £ Z,/n e Z 2 } is a unit norm b^ 1 -tight frame for Vj, and \^ C j km ■ k e Z,m 6 
Z 2 , 1 — 1, . . . , 15} forms a unit-norm b~ x -tight frame for Wj. 

Proof. We first verify that the scaling function generates a 1 -tight frame for a closed subspace of L 2 (R 2 ). 
By Proposition |3.1| the operator F intertwines shears and translations in the second component, 

s k rm = rm,ti -Hx) = 7(n(^.r2© - k) . 

Moreover, it intertwines chirp modulations with standard modulations. The overall dilation is irrelevant 
because Aj is unitary, so we can set 7 = for simplicity. Therefore, it is enough to prove that {M mi (p ® 
Mm.bTkw] defines a & -1 -tight frame for r~ l (Vj). This follows from the fact that w is the unit norm window 
function of a b^ 1 -tight Gabor frame and from (f> being an orthonormal scaling function. Since the subspaces 
<g> L 2 (W), \f/ t ® L 2 (W), I - 1, . . . , 15 are mutually orthogonal, and the functions [(j>, : I = 1, . . . , 15} satisfy 
a two-scale relation of an MRA with dilation factor M = 16 in L 2 (M), the claim follows. □ 

Theorem 3.2. The scaling and wavelet subspaces Vo and Wo as defined in Definition \3.3\ satisfv the two-scale 
relation 

V e W = A 4 V . 

Proof. We note that the functions 

4> ® w and i[ic®w,{ = 1 , . . . , 15 

are orthogonal by assumption, and the orthogonality remains under the usual modulations in the first com- 
ponent. On the other hand, the window function in the second component forms a tight Gabor frame under 
translations and modulations, so each of the tensor products generates a tight frame for its span. 

Since the subspaces 0®L 2 (K), ifrc ®L 2 (R), 6 = 1, . . . , 15 are mutually orthogonal, and the functions {<p, ij/c] 
satisfy a two-scale relation of an MRA with dilation factor M = 16 in L 2 (K.), the claim follows. □ 

Since implementations only concern a finite number of scales, the following result becomes important. It 
is an easy consequence of Theorem l3.ll 

Corollary 3.1. The group-based Gabor shearlet system QQSj a ($, {t^f}^; w) as defined in Definition \3.2\ for 
any jo E Z, or the system { x Pj km ■ j,k € Z,m 6 Z 2 ,^ = 1, . . . , 15} forms a unit-norm b~ l -tight frame for 
L 2 (M 2 ), and consequently it has uniform redundancy fir —H + — b~ l . 
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4. Cone-Adapted Gabor Shearlets 



The construction of nearly orthonormal cone-adapted Gabor shearlets is based on complementing a core 
subspace Vo which has the usual MRA properties for L 2 (R 2 ) under scaling with a dilation factor of 16. The 
isometric embedding of Vo in V\ proceeds in 3 steps: 

1 . Vi is split into a direct sum of two coarse-directional subspaces, Vj 1 and V\, corresponding to horizon- 
tally and vertically aligned details, respectively. 

2. Each of these two coarse-directional subspaces is split into a direct sum of high and low pass compo- 
nents. The low-pass subspaces Vi and combine to Vo = VjJ ffi VL 

3. The high pass components are further split into subspaces with a finer directional resolution obtained 
from shearing. 

The first step in the process of constructing the cone-adapted shearlets is a splitting between features that 
are mostly aligned in the horizontal or in the vertical direction. The shearlets then refine this coarse splitting. 

4.1. Cone adaptation 

In addition to filters which restrict to cones in the frequency domain, we introduce quarter rotations for 
the splitting of horizontal and vertical features. This enables us to define two mutually orthogonal closed 
subspaces containing functions with support near the usual cones for horizontal and vertical components. As 
in the case of wavelets, the main goal of this construction is that the smoothness of a function in the frequency 
domain is not substantially degraded by the projection onto the subspaces. 

Again, we use standard filters from wavelets in our construction. For this, we define a version of the 
Cayley transform = j^f, which maps £ e R to the unit circle T = {z e C : |z| = 1}. The inverse map is 
defined on T \ {-1}, ( _1 (z) = ij^. We use the map £ to lift polynomial filters on T to rational filters on R. 

Lemma 4.1. Let H : T -> C satisfy \H(z)\ 2 + \H(-z)\ 2 = 1 for all z e T, then H(£) := H(^)) is a function 
on R which satisfies 

|#(f)| 2 + |#(-l/£| 2 = l. 

Proof. The Cayley transform intertwines the reflection £ h-> on R with the reflection about the origin, 
because 

1 + Z/£ -1 + 2£ 

Thus, the property of H is a direct consequence of this coordinate transformation. □ 

We observe that if H(z) has N-l vanishing derivatives at z = -1, H(-l) = H'(-l) = ■■■ = H (JV_1) (-1) = 
0, then H(%) decays as ij~ N at infinity. 

Definition 4.1. Let H : T -> C satisfy the Smith-Barnwell condition \H(z)\ 2 + \H(-z)\ 2 = 1 for all z e T. 

Its associated filter operators H + , H + , H and H , are defined to be the multiplicative operators with the 
Fourier transform of any f € L 2 (R 2 ) in the frequency domain according to H+f(i;) = H(((^2/i;\))f(0 and 
H-f(£) — H(-((^2l ^\))f(S), the overbar denoting multiplication with the complex conjugate. We denote R 
to be the rotation operator on L 2 (R 2 ) given by Rf {^\,^2) — f (%2, ~^\)- 

This allows us to introduce a pair of complementary orthogonal projections, which split the group based 
Gabor shearlets into a vertical and a horizontal part to balance the treatment of directions. The design of these 
projections is inspired by the description of smooth projections in 12311 . 

We start the construction with isometries associated with the vertical and horizontal cone, which we 
denote by C v and C/,, respectively. By the set inclusion, L 2 (C V ) and L 2 (C/,) naturally embed isometrically in 
L 2 (R 2 ). We denote these embeddings by i v : i v /(£) = /(f) if f € C v and i,,/(f) = otherwise and similarly 
for £/j. We wish to find isometries that do not create discontinuities. 
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Theorem 4.1. Let H : T -> C iafii/y |H(z)| 2 + \H(-z)\ 2 = 1 /or all z e T, and let H+, H+, H and 
H be defined as in Definition \4.1\ Let C v — {x e M 2 : [jc^I > |xi|} and Ch — R 2 \ C v , then the map 
H v : L 2 (C V ) -> L 2 (R 2 ) given fey 



H v/ = *_(/- — — ^) tv/ . 
is an isometry, and so is the map H/, : Lr(Ci,) — > L 2 (R 2 ), 



Ha/ = H + (/ + 



1 + i 



-/? + 



1-i. 



-^) t/l /. 



2 2 

Moreover, the range o/H v is f/ze orthogonal complement of the range ofE/, in L 2 (R 2 ). 

Proof. We begin by showing that H v and E/, are isometries. The space L 2 (C V ) splits into even and odd 
functions. After embedding in L 2 (R 2 ) these functions then satisfy R 2 t v f = t,,/ or /? 2 i,,/ = -i v f, respectively. 
By the definition of R, the operator I - - ^R? maps even t,/ to 



I- 



1 + i 



-R - 



i-'' 3 



1 . 1 ^ 2 1 



which implies that it is an eigenvector of R, R(\l + \R 2 - \R - \R 3 )hf = + \R 2 ~ \ R ~ \ R ^)hf and 
for odd / 



1 + i 
I R 



1 1 



fl J \ h f = \-I--R<--R+-R> \ h f 



which gives R(\l - \R 2 - {R + ±R 3 )i v f = i(±7 - ±# 2 - {R + ^ 3 )i v /. 

Similarly, the operator (7 + ^R + ^-R 3 ) maps the even functions into functions that are invariant under 
R, whereas the odd functions give eigenvectors of R corresponding to eigenvalue -i. We verify that for even 

t-hf, 



1+i 
7+ R 



1-i, 



1 1,1 



1 



Ry h f = {-i + -R< + -R + -Ry hf . 

Hence we get the eigenvalue equation R( jl + \R 2 +\R + \R 3 )ihf = (\l + \R 2 + \R + \R 3 )ihf- Analogously, 
for odd /, 



1 + 



1 +i 



1 -i 



/? J I t j/ = I \i - V + Lr- Lr 3 I L h f 



-R + 



which yields R(±I - ±# 2 + ±R - ^R 3 )i h f = (-i)(jl - jR 2 + jR~ jR 3 )ihf. 



Since R is unitary, the eigenvector equations imply that the orthogonality between even and odd functions 

^ 3 )or(7- ±gR- ifi 



is preserved by the embedding followed by the symmetrization with (I + ^R + or (/ - ^R - ^R 3 ). 



Thus, the identity 



1 + i 
I R 



L 2 (K 2 ) 



2||/|| 2 2(Ci) for all/ e L 2 (C„) 



can be verified by checking it separately for even and odd functions. Next, multiplying by H and using that 
|/f_| 2 + R^\H-\ 2 R = |7L.| 2 + \H+\ 2 = 1 gives by the orthogonality of Ri v f and i v f the isometry 



— / 1 + i 
H II R 



1-i 



R* )hf 



\\H-i v ff m 



= W-t v f\\ 2 L2m 



H- 



1+i 1 - i , 
—R + —R%f 



L 2 (M 2 ) 



1+i 1-i i . 

-^- + ^-* 2 k/ 



L 2 c 



ll/Z-iv/ll 2 ,^, + IHWIlL 
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The same proof applies to Ej. 

To show that the ranges are orthogonal complements of each other, we define the orthogonal projections 
P v = H V H* and Pi, = Hj,Ef . We first establish that these projections have the more convenient expressions 

/ l+i \-i \ — — / l+i 1- i \ 

P A = ff + / + — 7? + — R 3 \H + and P v = #_/-—£- —R 3 #_ . 

To this end, we note that if M v is the multiplication operator M,,/(£) = Xc,f(%) with ;tc r the characteristic 
function of C v , and similarly for M/„ M/,/(£) =^Cf,(f)> then by definition 

1+/ 1 - i , l + i 1 - i , — 

P,, = E A E* = //+(/ + — Z? + —R 3 )M h (I + —R + —R 3 )H + . 

We simplify this expression using that M h R = RM V , M v + M h - I and R 2 M h = M h R 2 , which gives the 
identities 

l+i 1 - i , 1 + z 1 - z , 1 + z 1 - z , l+i 1 - i q 1 + z 1 - z , 

( P+ R 3 )M h +M h ( P+ R 3 ) = ( P+ R)M h +( R+ R 3 )M r = R+ R 3 

v 2 2 ; n /a 2 2 2 2 2 2 2 2 

and 

1 + Z 1 — Z q 1 + Z 1 — Z' q 1 + Z 1 — Z q , 

( R + R 3 )M h ( R + R 3 ) = ( R + R 3 ) 2 M V = M v . 

v 2 2 2 2 2 2 

Inserting this in the expression for Pi, results in 

l+i 1 - i q l + i 1 — i q — 

H + (I + —R + —R 3 )M,,(I + —R + —R 3 )H + 

= H + (M h + {-^-R + ^Y~R 3 ) + M V )K = H + (I + ^-R + l -^!-R 3 )H;. 

The identities for P r are completely analogous. 

Finally, we show that the two orthogonal projections are complementary. To this end, we use 

— — l + i 1 - i , 
Pi, = H + H + + H + H-(—R + ~ R ) 

and 

— — l + i 1 - i q 

P v = H H - H-H + ( R + R 3 ) 

2 2 

which gives the identity after elementary cancellations and H + H+ + H H = /. Since Pi, is by definition an 
orthogonal projection, P v = I - Pi, is the complementary one. Thus, the ranges of H/, and H v , or equivalently, 
the ranges of Pi, and P v , are orthogonal complements in L 2 (R 2 ). □ 

For later use, we denote the range spaces of S/, and H v by 

L 2 h (R 2 ) = S A (L 2 (M 2 )) and L 2 ,(R 2 ) = S,,(L 2 (E 2 )) , 

which are orthogonal complements in L 2 (R 2 ). 

Under the isometries H v or S/,, a unit norm tight frame for L 2 (C V ) or L 2 (Ci,) is mapped to a unit norm 
tight frame for Lj^M. 2 ) or L 2 (R 2 ). The consequence of this is that we only need to construct shearlets for 
the horizontal and vertical cones, not for all of M 2 . If the shearlets have smoothness and the appropriate 
periodicity, then we retain smoothness under the symmetrization. 

Corollary 4.1. Let g e L 2 (C V ) be a function which is continuous in C v and even, then S v g is continuous on 
M. 2 . If in addition there is h : R — » C such that g 6 L 2 (C V ) satisfies 

and h is in C k (M. X R*) and 2-periodic in its first component, then S v g is k times differentiable in R 2 \ {0}. 
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Proof. By the2-periodicity, g(ft,ft) = fe(l,ft7|ft|) = ft(-l,ft7lftl) = S(ft,~ft). Thus, continuity of h 
ensures that of E v g. A similar argument holds for differentiability. □ 

This implies that we only require a resolution of the identity on [—1,1] with an appropriate version of 
Gabor frames. We refer to a result of S0ndergaard from 1^21 . 

Theorem 4.2 ( 1I22I0 . Let No £ N, a — 2/No and choose t £ N, r < iVo- Ifw is a function in the Feichtinger 
algebra, and if {M mT f{TkavA is an tight Gabor frame for L 2 (R), then the periodization w°, 

w"(g) = w(f - 2«)/or a.e. £ £ K , 

neZ 

defines an —tight Gabor frame {M mT/2 T ka w° : < k < N - l,m e 7L\for L 2 ([-l, 1]). 

Of particular interest to us is the following corollary, which we can draw from this result. 

Corollary 4.2. The uniform redundancy 'R~ = = -r of the Gabor frame {M mT /2Ti ca w° : < k < 
No — l,m e Z] defined in Theorem \4.2\ can be chosen as close to one as desired by choosing No,T £ N 
sufficiently large. 

We remark that tightness is preserved when periodizing the window, and if its support is sufficiently small 
then so is the norm. 

Before stating the definition of cone-adapted Gabor shearlets, we require the following additional ingre- 
dients. We consider the change of variables (ft, ft) ^ Y(0 = (yiCft.T?©)' 1 e C 1 > v )> defined by 

= isgn(6)& y£(ft = f and yftfl = ±sgn(&)& y 2 Xft = f 1 - 

We let T/, and T v denote the associated unitary operators, r/,/(ft = f{y h {^)) and r v /(ft = /(y l '(ft). For each 
orientation v or h, we define the appropriate dilation, shear, and modulation operators by 

A')=A h X h m =X m , andS h k =S k , 

and if/(ft,ft)=?(ft,ft),then 

A}/(ft,ft) = A*?(ft,ft), X*/(ft,ft) = X*?(ft,ft), and S£/(ft,ft) = S£?(ft,ft). 

Definition 4.2. Lef foe an orthogonal scaling function of a \6-band multiresolution analysis in L 2 (R), with 
associated orthonormal wavelets {>]/( : t — 1, . . . , 15}, and let No, T £ N such that w is the unit norm window 
function of an ^--tight Gabor frame {M m2T /2T2k/N w '■ m 2,k £ 11\ for L 2 (W), with the periodization w° as 
described in Theorem \4.2\ Let jo £ Z. Then the associated cone-adapted Gabor shearlet system is defined by 

cgs k {ci>,mtv™) : = {*5,Am.*U» : k 6 Z >\ k/N °\ * 2 ' _1 ' m 6 z2] 

U^Km' T u.„, : /.* e z > J * Jo, \k/No\ < 2 j -\m e Z 2 , € = 1, . . . , 15} c L 2 (R 2 ), 

where 

j,k,m 11 J {mx.miTll) zfe/JVo " r .MVfi J (Wi,m2T/2) 2k/7Vo 1 ' 

a«c/ accordingly 

=E h A h i X* m .j'.rrffStv and ^/f = H^JKL, m W9^«« W 

For an illustration of the support of the special case of cone-adapted Shannon shearlets and the more 
general cone-adapted Gabor shearlets, we refer to Figure 
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Figure 2: (a) Support of the cone-adapted Shannon shearlet scaling functions in the frequency domain, in horizontal and vertical orien- 
tations; (b) Support of a cone-adapted Gabor shearlet scaling function in the frequency domain, corresponding to a Gabor frame with 
No = 4. The smallest achievable redundancy with No = 4 is obtained by setting r = 3, resulting in Nq/t = 4/3. With sufficiently large 
values of No, and the implicit finer directional resolution, the choice r = No - 1 allows the redundancy to get as close to one as desired. 



4.2. MRA Structure 

By classical results from frame theory, without restriction of the parameters the system consisting of the 
functions <J>'! , , <t> 1 ! , forms a tight frame. 

y.K.m' j,k,m & 

Theorem 4.3. Let No e 2N, a — 2 /No and t e N, and let w be the unit norm window function of an ^--tight 
Gabor frame {M miT j2TkaW : m%, Ic € Z}for L 2 (M.), such that the periodization w° is a unit-norm Gabor frame 
{M miT i2T] la w°}for L 2 ([— 1, 1]), then the system C@Sj (</>, {i/J[}\ S = {, w)for any jo € Z, or the system 

W'jin' Y J : L : J> ke Z > WW * 2j ~ l < m 6 Z 2 ,€ = 1, . . . , 15} 

is a unit-norm ^-tight frame for L 2 (R 2 ), with redundancy 
Proof. The family 

K , < I| , m2 r/2)Sa/«„ r '.^ ®w:j,ke Z, \k/N \ < 2 J -\m e Z 2 ; {= 1, . . . , 15} 
is an ^--tight frame for L 2 (Ch). Consequently, under the isometry, 

{*£ = M^^U* ® w ° ■ J> k 6 z ' * 2J '~ l > m e Z 2 ,^ = 1, . . . , 15} 

is an fi-tight frame for P,,(L 2 (IR 2 )). Similarly, 

| V I'L = ^) x lm um ^l m h T$t ®w°:j,ke Z, \k/N \ < V~\m e Z 2 , 1= 1, . . . , 15} 
is a ^-tight frame for P,,(L 2 (R 2 )). By the orthogonality of the ranges for P/, and P v , the union 

t*Km' T jln, : J> ke Z ' WW * 2hl <™ € Z 2 , ^ = 1, . . . , 15} 

is an ^--tight frame for L,2(M?). The proof for the case of CQSj a {(/>, (iM^ ; w) is similar. □ 
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Corollary 14. 2| implies that the redundancy can be chosen arbitrarily close to one. 

Corollary 4.3. The redundancy ^ of the preceding Gabor shearlet system can be chosen arbitrarily close 
to one. 

5. Optimal Sparse Approximations 

In this section, we show that under certain assumptions the cone-adapted Gabor shearlet system CQSj {(f>, {^i)\ 5 =i ; w) 
as defined in Definition 



provides optimally sparse approximation of cartoon-like functions, similar to 



k2 

'classical' shearlets (see 111*11 HI). Due to the asymptotic nature of the optimally approximation results, which 
involve only with shearlets with large scale j, without loss of generality, we consider C@Sj a (<p, {lAf}'^; w) 
with jo = and denote the system as CQS^V 1 ' , V P). We will first state the main result and the core proof in 
the following subsection, and postpone the very technical parts of the proof to later subsections. 

5.7. Main Result 

We first require the definition of cartoon-like functions. For this, we recall that in & 2 {A) denotes the 
set of cartoon-like functions f, which are C 2 functions away from a C 2 edge singularity: f = fo +f\XB, where 
/o,/i 6 C 2 ([0, l] 2 ) and \\f\y := £m<2 \Wf\U < 1 with d" = d\'d l 2 2 being the 2D differential operator with 
order d\ = j^, 82 = and v = (vi, V2). More precisely, in polar coordinates, let p(ff) : [0, 2n) i-» [0, l] 2 be 
a radius function satisfying sup e \p"{6)\ < A and p < po < 1. The set B c R 2 is given by B — {x e [0, l] 2 : 
1Mb ^ p(0)}- In particular, the boundary dB is given by the curve in M 2 : B{9) = (p(6) cos 9,p(9) sin 9). 

Utilizing this notion, we can now formulate out main result concerning optimal sparse approximation of 
such cartoon-like functions by our cone-adapted Gabor shearlet system as follows. 

Theorem 5.1. Let f e S 2 (A) and fy be the N-term approximation of f from the N largest cone-adapted 
Gabor shearlet coefficients {(f,^^) : ^ e C@SQV h , x ¥ v )} in magnitude. Then 

\\f-f N \\l <c-AT 2 -(logA0 3 . 

To prove this theorem, we follow the main idea as in J3, IT ]. In a nutshell, we first use a smooth partition 



of unity that decomposes a cartoon-like function / into small dyadic cubes of size about 2~ J x 2~ J . If j is large 
enough, then there are only two types of dyadic cubes: one intersects with the singularity of the function, 
namely, the edge fragements, and the other only contains the smooth region of the function. We then analyze 
the decay property of the shearlet coefficients. Eventually, by combining the decay estimation of each dyadic 
cube, we can prove Theorem l5.ll 

Though the main steps are similar to J2I EH, we however would like to point out that some of the key 
steps require slightly technical extensions of results in Jl, IT]. For the results available in J3, IT ], we simply 



state them here without proof for the purpose of readability. 

Let us next state some necessary auxiliary results, including Theorem 15.21 for the decay estimate with 
respect to those edge fragements and Theorem l5.3l for the decay estimate with respect to those smooth regions. 

An edge fragement (see Figure |3) is of the form 

f(x U X 2 ) = W Q (2 J X 1 ,2 J X2)g(x l ,X 2 )l{ Xl >E(x 2 )}, (1) 

where wo,g are smooth functions supported on [-1, l] 2 and |/i"0t)l < A. 

Let Qj be the collection of dyadic cubes of the form Q = [m\ jV, (m\ + l)/2-'] x \mil2 s (m 2 + l)/2-']. For 
wo a nonnegative C°° function with support in [-1, l] 2 , we can define a smooth partition of unity 

Y^w Q (x) = l, ieR 2 

QeQ, 

with wq = wo&xi - m\,2 J X2 - mi). If Q e Qj intersects with the curve singularity, then /g := fwQ is an 
edge fragment. 

Let Q°j be the collection of those dyadic cubes Q e Qj such that the edge singularity intersects with the 
support of wq. Then the cardinality 

\&j\ < c ■ V. (2) 
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X2 



fE(x 2 ) 



Figure 3: An edge fragment. 



2 • 2~' 



Similarly, Q . := Qj\Qj are those cubes that do not intersect with the edge singularity. We have 

\Q)\ < c ■ 2 2j + 4 ■ 2 j . 



(3) 



Let {fy} be a sequence. We define \s„\(N) to be the Mh largest entry of the {1^1). The weak-t p quasi-norm 



\ w {p of {s^} is defined to be 



fylU? := sup(A 1/p ■ kjalov)). 



which is equivalent to 



N>0 



Ws/iWwep - sup(#{// : |s M | > e) ■ e p ) 

\e>0 



I//' 



We abbreviate indices for elements in CQSi^ 11 , X V V ) and write 4^ with yU = (/ fc, m; t, £). The index set at scale 
j is Aj := {// = (;, fc, m;ij):ke Z, |&/iVol < 2^ 1 , m e Z 2 ; £ = 1, . . . , 15, t = ft, v}. 

Now similar to 111 ll Theorem 1.3], we have the following result which provides a decay estimate of the 
coefficients with respect to those Q e <3 ( j. 

Theorem 5.2. Let f e £> 2 (A) and /g := fwQ. For Q e Q® with j > fixed, the sequence of coefficients 
{</ 2 ,^>:yue A ; } obeys 

/or some constant c independent of Q and j. 

Similarly, for the smooth part, we can show that the sequence of coefficients {(/q,^) : p e Ay} with 
Q e <3j obeys the following estimate (c.f. fill Theorem 1.4]). 

Theorem 5.3. Let f e B 2 (A). For Q e <2j. with j > fixed, the sequence of coefficients {(/q,^) : // e Ay} 
obeys 

IK/fi.^lU/s <c-2-^ 

/or some constant independent of Q and j. 

The proofs of Theorems 15.21 and 15.31 are very technical and require extension of results in JKH. We 
therefore postpone their detailed proofs to the next two subsections. As a consequence of Theorem 15. 2 1 and 
Theorem l5.3l it is easy to show the following result. 

Corollary 5.1. Let f e £ 2 (A) and for j > 0, let Sj(f) be the sequence of Sj(f) = {(/, T^) : p. e Ay}. Then 

\\Sj(f)\\ w fiP < c 
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Proof. By the triangle inequality, 



2/3 

2/3 



<c-|Q°|-2^' + c-|Q]|-2- 2 ^ 
< c. 

□ 

Now, we can give the decay rate of our cone-adapted Gabor shearlet coefficients as follows. 

Theorem 5.4. Let f e & 2 (A) and s(f) := {(/, ^) : ^ E C0SQ¥ h , ¥")} fee f/ie cone-adapted Gabor shearlet 
coefficients associated with f. Let {\s(f)\(N) '■ N = 1, 2, . . .} be the sorted sequence of the absolute values of 
s(f) in descending order. Then 

sup k(/)| w <c-Ar 3/2 -(logA0 3/2 . 

Proof. From Definition^ we have ^ = E h g". km with g% m = A^^S^r*^ ® w°. Then, 

= «+(/ + l -^R + ^^)4 f , m (^i^2) 



1 + i h,e , e t n , 1 ~ ' w 



+ H(a&/fi))(— s %^2, -£i) + — ^(-f2,fi» 

=: gi + g2- 

For analyzing the optimal sparsity, we first consider ® h f m — gu which can be rewritten as follows: 
with 

oWtf)) ■= M2-*ty(£)yw°(2Jy*(g) - ^) (4) 

and 

ejAAO) ■= 2- 3 J' 2 e 2nim ' 2 ' i '^e^^. (5) 

For simplicity, we use again the compact notation ©/,(£) := cr^(y 1 (^))e ;m (y(^)) with fi = (j, k, m; i, €) e Aj. 
The index set Ay at scale j is as before. 
By Corollary 15. II we have 

R(j, e) := e A,- : \{f, ® M >| >e}<c- e y \ 

Also, 

l</, © M >l < c • 2- 3 ^ 2 . 
Therefore, R(j, e) = for y > § log 2 (e _1 ). Thus 

#lA< : l</> ^>l > < R U> e) ~ c • e ~ 2/3 ' lo g2(^')^ 

Repeating the steps for the second term in the definition of x ^^ m shows that we can replace by 

at the cost of a change of the constant c. This can be seen from the fact that the term ^g h f m (£2> + 

^Ygj'ljJ,— £z» fi) is supported in the vertical cone and thus g2 can be viewed as composed of two quarter- 
rotated elements of the form of gi. The same strategy applies to the vertical cone elements. The theorem is 
proved. □ 
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Now we can prove Theorem |5. 1 l using the above results. 



Proof of Theorem \5.1\ fy = zZftei N (f< ^iiY^ii where 1^ is the set of indices corresponding to the N largest 
entry of {|(/, : p.}. By the tight frame property and Theorem 15 .41 we have 

11/ - Ml 2 < J] \s(f)\f m < c ■ J] AT 3 log(A0 3 < c ■ N~ 2 ■ log(^V) 3 . 



n>N 



This finishes the proof of the theorem. 



□ 



5.2. Analysis of the Edge Fragments 

We shall focus on proving Theorem l5.2l next. To that end, we need some auxiliary results first. From lUll 
Theorem 2.2] or J^, Theorem 6.1], we have the following result, which gives the estimate of the decay of the 
edge fragment in the Fourier domain along a fixed direction. 

Theorem 5.5. Let f be an edge fragment as defined in (HJ and Ij :— [2 2j ~ a , 2 2j+ ' 3 ] with a e {0, 1, 2, 3, 4} and 
fj E {0, 1,2). Then, 

\f(A cos 9, A sin 0)\ z dA < c ■ T" 1 ■ (1 + 2 ! \ sin 0|)" 



J i.i 



Use Theorem l5.5l one can prove the following result (c.f. Ill U Proposition 2.1]). 
Corollary 5.2. Let f be an edge fragment as defined in (Q~|). Then 



f i/©iV-,(/©)i 2 ^ < c ■ 2- 3 \i + \k\y 

Jm 2 h 



Note that although cr C j k {y h {^)) might not be compactly supported compared to 111 ll Proposition 2.1], it 

does not affect the result here, since proofs related to the support of o~ l . k {y h {^)) can be passed through its 
essential support and the estimate outside the essential supported is absorbed in the constant c. For elements 
in the vertical cone 0"^(y v (^)), similarly to the above result, one can show that the decay estimate is of order 
less than 2'^(\ + \k\Y 5 . 

From ifTll Corollary 2.4] or ^ Corollary 6.6], we have the following result about the decay of the deriva- 
tive of the edge fragment in the Fourier domain along a fixed direction. 

Corollary 5.3. Let f be an edge fragment as defined in ([T]) and v — (vi, vi)-Then 



f 

J\A\eIj 



\d v f(A cos 6, A sin 8)\ 2 dA < c v ■ 2 



-2/M 



-2jvi 



-4j 



(1 +2 ; |sin<9|r 5 +c v -2 



■2JM 



-Wj 



We also need the following lemma (see (Uj, Lemma 2.5]), which follows from a direct computation. 
Lemma 5.1. Let cr^(-y' ! (£)) be given as above. Then, for each v = (vi, V2) 6 N 2 , vi, v% e {0, 1,2), 

\d v c/. k (y h (^))\<c v -2- ( - 2v ^-(l + \k\y\ 
where \v\ — Vi +V2 and c v is independent of j and k. 

Use the above results, we can prove the following result, which is an extension of Proposition 2.3] 
and can be proved with similar approach. 

Corollary 5.4. Let f be an edge fragment defined as in ©, <f e . k (y h (g)) be defined as above, and L, be the 
differential operator defined by 



L t = 



tl- 



2 2 J 



2n{\ + \k\) 



2 \ 

^2 



where t > is a fixed constant. Then 

f \l, (mc/j^mf dA < c, ■ 2- 3 >(l + \k\y 
for some positive constant c, independent of j and k. 
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Now we are ready to prove Theorem 15. 2 1 
Proof of Theorem \5 .2\ Fix j > 0, for simplicity, let / = fg be the edge fragment as in ([TJ. We have 



We have 



= (2 m -m I 2- 4 ''sgn(^ 1 )^ 1 - 2^^2^|)^, m (/©). 

2 ^ 



Also, 



^ e;>! (/©) = (2^m 1 2- 4j sgn(^ 1 ) + 2tt/--2 j+1 ^^(/(fl) 

+ (2 m -m 1 2- 4 -'sgn(^ 1 )^ 1 - Ini^V^fe 

2 £ 2 



Let L, be the differential operator defined in Corollary 15.41 Then, 



with 



f + 



(fisgn(^) + 5^^) + - ^^) 2 



(1 + |*|)2 



2 ft 



Let W),* be the essential support of tJ^Cy*©) defined as 



0L 6») : 2 2 V < U| < 2 2 V, arctan(2" J '( 1)) < 6 < arctan(2- 7 '(— + 1)) \ 

N N J 



(6) 



For £ e Wjjc, we have |ft | ~ 2 1 and one can show that a 2 J tan « A:. Consequently, we can choose large 
f > independent of y, £, m such that 



sup (ft| > c ■ 



1 + 



(mi - mik) 2 



[l + ra 9 ,] =: c • Gk{m) 



(1 + \k\f 

for some positive constant c independent of j, k, and m. For Gk(m), we have 



(7) 



GfcOn) = 



((1 +m 2 )(l + w 2 ) 



for = 



1 + 



d+l*t) 2 /l*P 



[l+m 2 ] for/t^O 



Consequently, 



W(ft4*(/(ft)) ■ L-\ ej , m (y h (mdt 



f 



WCft^Cr*©)) 
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Fork + and m :- (m\,m2) e Z 2 , define R,-„ := {m = (m\,mj) e Z 2 : j e \ih\,ih\ + l),m,2 = mo}. Since 



for £ fixed, {ej tm (y h ({-)) : £ 6 Z 2 } is an orthonormal basis for Lr functions supported on Wjjc, we obtain 

2 i</,^)i 2 < r 



< 



few, 
< c 



(/©)) 



[(1 + {milk- m 2 ) 



^— — 2-— \ \L,(f(t)o- e 

>n 2 ) 2 )(l +m 2 )] 2 J K 2 I * 



(/(£))) df. 



By Corollary 15. 41 we have 



X l</-^)l 2 ^c-G3 2 . 2" v (l + |*D" 



with Gfj, := (1 + (mi - »i2) 2 )(l + For k — 0, similarly, we have G„~, = (1 + ra 2 )(l + mj). 

Let Nj^^(e) :- #{m e R,j, : \(f, *P„)| > e}. Then Njj,f„(e) < c ■ (1 + |fc|) and the above inequality implies 

:-2 . 9-3/ . c -2 



Thus, 



which implies 



Njxihie) <c-G^- 2~ >] ■ e- L ■ (1 + \k\f\ 
N iXrn {e) < c • min(l + |jfc|, G, 2 ■ 2~ 3j ■ e 2 ■ (1 + |£|)- 5 ), 



X Nj, Kih (e)<c-Gf 3 -2-J-e 



2/3 



/t=-2J 



Since 2mez 2 ^r^ 3 < °°> by above inequality, we obtain 



#{p e Mj : Y M >| > e} < J] £ AT iiMl (e) < c ■ 2~ j e 



2/3 



meZ 2 *=-2> 



which is equivalent to the conclusion that 



\m,%)\i^ <c-2-^ 2 . 



□ 



5.3. Analysis of the Smooth Region 

Now, we shall focus on proving Theorem 15.31 Let us provide some lemmas first. From Q Lemma 8.1] 
or 111 ll Lemma 2.6], we have 

Lemma 5.2. Let f = gWQ, where g e B 2 (A) and Q e (2j. Then 

f \f(f)\ 2 df < c ■ 2-'% 

where Wj t k is the essential support of a c - jk (j h (^)) as in 

From [ll|, Lemma 2.7] we have 
Lemma 5.3. for v = (vi, V2) 6 N 2 , 

v 

Yj |3v^(/©)| 2 < c -2- 2 i^. 

k=-V 
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Using the above two lemmas, one can easily prove the following result, which is an extension of fTH 
Lemma 2.8] and can be proved by a similar approach. 



Lemma 5.4. Let f = gWQ, where g e £ 2 (A) and Q e (3j. Define the differential operator L t :— (tl — A) 
with t > and A = d 2 , + d 2 . Then, 



^ 2 \L 2 t (mo-]jtfm\dt<c t -T 



10J 



k=-V 

for some positive constant c, independent of j. 

Now we are ready to prove Theorem 15. 3 1 
Proof of Theorem \5.3\ Let / = /g = gWQ and L, as defined in Lemma |5~4l We have 

L t (e p „(y"(0) = g"lM)ejAJ\OX 

where 

mi2- 2 ^sgn(^) + ^2 3 > +1 | m T , , 

^ = [, + 121-1 1 + 2 2 ^ 1 2^sgn(^ 1 - ^|) 2 

+ (^)% m (/©) 



Similar argument to the proof of Theorem 15. 21 we can choose t > large enough so that 

sup \g%(&\ > c ■ [1 + 2~ 2 Vi - m 2 kf + ml)]. 

Form := (mi, #12) e Z 2 , defined, := {m = {m\,m2) e Z 2 : 2~ 2 i(m\ - mjK) e [mi, mi + Y),mi = mi). Observe 
that for each m, there are only 1 + 2 2j choices for m\ in R r „. Hence #R m < 1 + 2 2j . Again, similar argument 
to the proof of Theorem 15. 21 we have 



2 \(m\ 2 <c- sup — 1— f \L^)o-e. k (y\m\ 2 df 

— — — 1 2 ^ 14 f |L 2 (/(^)^(7(i)))| 2 ^. 

[1+2 2 ^(mi - m 2 fcr + mf)] 4 J R 2 1 



< c 

Then by Lemma l5~4l 



£ 2 l(/,%>l 2 <c-G: 4 - f £ [l?C/X0oyA0))| 2 # 



< c ■ r; • 2 "". 



where G,~ n := 1 + m 2 + ih\. 
Using the Holder inequality 



^k„|"< ^k„| 2 N l -<" 2 , l/2<p<2. 



m=l Vm=I ' 

Since the cardinality of is bounded by 1 + 2 2 \ we have 



2 Z K/> V * c ' (2 2J ) 1 - p/2 ■ G~ 2p ■ 2- 5 "j 
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Moreover, since p > 1/2, Xmez, 2 G^ p < oo. Consequently, 

J] \(f, x V fl )\ p < c ■ 2 2iil - pl2) - 5pi = c ■ 2 2j(l - 3p) . 

In particular 



□ 



References 

[1] B. G. Bodmann, P. G. Casazza, and G. Kutyniok, A quantitative notion of redundancy for finite frames, 
Appl. Comput. Harmon. Anal. 30 (201 1), 348-362. 

[2] J. Cahill, P. G. Casazza, and A. Heinecke, A quantitative notion of redundancy for infinite frames, 
preprint. 

[3] E. J. Candes and D. L. Donoho, New tight frames of curvelets and optimal representations of objects 
with C 2 singularities, Comm. Pure Appl. Math. 56 (2004), 219-266. 

[4] I. Daubechies, Ten Lectures on Wavelets. CBMS-NSF Regional Conference Series in Applied Mathe- 
matics, 61, SIAM, Philadelphia, PA, 1992. 

[5] I. Daubechies, A. Grossmann and Y. Meyer, Painless nonorthogonal expansions, J. Math. Phys. 27 
(1986), 1271-1283. 

[6] M. N. Do and M. Vetterli, The contourlet transform: An efficient directional multiresolution image 
representation, IEEE Trans. Image Process. 14 (2005), 2091-2106. 

[7] K. Grochenig, Foundations of Time-Frequency Analysis, Birkhauser, Boston, 2001. 

[8] P. Grohs and G. Kutyniok, Parabolic molecules, preprint. 

[9] K. Guo, G. Kutyniok, and D. Labate, Sparse multidimensional representations using anisotropic dila- 
tion and shear operators, Wavelets and Splines (Athens, GA, 2005), Nashboro Press, Nashville, TN 
(2006), 189-201. 

[10] K. Guo, D. Labate, W. Lim, G. Weiss, and E. Wilson, Wavelets with composite dilations and their MRA 
properties, Appl. Comput. Harmon. Anal. 20 (2006), 231-249. 

[11] K. Guo and D. Labate, Optimally sparse multidimensional representation using shearlets, SIAM J. 
Math. Anal. 39 (2007), 298-318. 

[12] B. Han, G. Kutyniok, and Z. Shen, A unitary extension principle for Shearlet Systems, SIAM J. Numer. 
Anal. 49 (2011), 1921-1946. 

[13] B. Han, S. Kwon and X. Zhuang, Generalized interpolating refinable function vectors, J. Comput. Appl. 
Math. 227 (2009), 254-270. 

[14] B. Han and X. Zhuang, Matrix extension with symmetry and its application to symmetric orthonormal 
multiwavelets, SIAM J. Math. Anal. 42 (2010), 2297-2317. 

[15] B. Han and X. Zhuang, Algorithms for matrix extension and orthogonal wavelet filter banks over alge- 
braic number fields, Math. Comput. 82 (2013), 459-490. 

[16] R. Houska, The nonexistence of shearlet scaling functions, Appl. Comput. Harmon. Anal. 32 (2012), 
28-44. 



23 



[17] P. Kittipoom, G. Kutyniok, and W.-Q Lim, Construction of compactly supported shearlet frames, Con- 
str. Approx. 35 (2012), 21-72. 

[18] G. Kutyniok and D. Labate, Shearlets: Multiscale Analysis for Multivariate Data, Birkhauser, Boston, 
2012. 

[19] G. Kutyniok and W.-Q Lim, Compactly supported shearlets are optimally sparse, J. Approx. Theory 
163 (2011), 1564-1589. 

[20] G. Kutyniok and T. Sauer, Adaptive directional subdivision schemes and shearlet multiresolution anal- 
ysis, SIAM J. Math. Anal. 41 (2009), 1436-1471. 

[21] S. Mallat, A Wavelet Tour of Signal Processing, Academic Press, San Diego, 1998. 

[22] P. L. S0ndergaard, Gabor frames by sampling and periodization, Adv. Comput. Math. 27 (2007), 355- 
373. 

[23] G. Weiss and E. Hernandez, A First Course on Wavelets, CRC Press, Boca Raton, 1996. 



24 



